Adaptive headphone system

ABSTRACT

Various implementations include headphone systems configured to adapt to changes in user environment. In some particular cases, a headphone system includes at least one headphone including an acoustic transducer having a sound-radiating surface for providing an audio output; a sensor system configured to detect an environmental condition proximate the at least one headphone; and a control system coupled with the at least one headphone and the sensor system, the control system configured to: receive data about the environmental condition from the sensor system; and modify the audio output at the at least one headphone in response to a change in the environmental condition, wherein the audio output includes a continuous audio output provided across a transition between environmental conditions and is configured to vary with the change in the environmental condition.

PRIORITY CLAIM

This application claims priority to pending U.S. patent application Ser.No. 16/048,640 (Adaptive Headphone System), filed on Jul. 30, 2018,which itself claims priority to U.S. Provisional Patent Application62/538,849 (Adaptive Headphone System), filed on Jul. 31, 2017, theentire contents of each of which are incorporated here by reference.

TECHNICAL FIELD

This disclosure generally relates to audio systems. More particularly,the disclosure relates to audio systems, such as headphones, includingan adaptive module for modifying audio feeds across differentenvironments.

BACKGROUND

Portable electronic devices, including headphone and other audio systemsare becoming more commonplace. However, the user experience with theseaudio systems is limited by the inability of these systems to adapt todifferent environments.

SUMMARY

All examples and features mentioned below can be combined in anytechnically possible way.

Various implementations include audio systems, such as headphonesystems, configured to adapt audio output based upon particularenvironmental conditions. In other implementations, acomputer-implemented method is disclosed for controlling an audiosystem, such as a headphone system.

In some particular aspects, a headphone system is disclosed including:at least one headphone including an acoustic transducer having asound-radiating surface for providing an audio output; a sensor systemconfigured to detect an environmental condition proximate the at leastone headphone; and a control system coupled with the at least oneheadphone and the sensor system, the control system configured to:receive data about the environmental condition from the sensor system;and modify the audio output at the at least one headphone in response toa change in the environmental condition, where the audio output includesa continuous audio output provided across a transition betweenenvironmental conditions and is configured to vary with the change inthe environmental condition.

In other particular aspects, a computer-implemented method ofcontrolling a headphone system configured to provide an audio output isdisclosed. In these cases, the method can include: receiving dataindicating an environmental condition proximate the at least oneheadphone from a sensor system; and modifying the audio output at the atleast one headphone system in response to a change in the environmentalcondition, where the audio output includes a continuous audio outputprovided across a transition between environmental conditions and isconfigured to vary with the change in the environmental condition.

Implementations may include one of the following features, or anycombination thereof.

In some implementations, the audio output is at a decibel level rangingbetween approximately 50-70 decibels (dB). In certain cases, modifyingthe audio output includes decreasing or increasing the decibel level ofthe continuous audio output in response to the change in theenvironmental condition.

In certain implementations, the sensor system includes a positiontracking system, and the environmental condition includes a location ofthe at least one headphone. In some cases, the location includes aproximity to a location of interest.

In certain cases, the sensor system includes at least one of anaccelerometer or a gyroscope, and the environmental condition includesan acceleration of the at least one headphone or a deceleration of theat least one headphone. In some implementations, the control system isconfigured to increase a volume of the audio output or an intensity ofthe audio output in response to receiving data indicating the headphoneis accelerating. In certain cases, the control system is configured todecrease a volume of the audio output or an intensity of the audiooutput in response to receiving data indicating the headphone isdecelerating.

In some implementations, the control system is coupled with a smartdevice having access to a user profile or biometric information about auser, and the control system is configured to modify the audio output atthe at least one headphone based upon the user profile or the biometricinformation about the user. In certain cases, the user profile includessettings for audio notifications at the at least one headphone, and thecontrol system is configured to modify the audio output at the at leastone headphone according to the settings.

In some cases, the sensor system includes a microphone, and theenvironmental condition includes an ambient audio signal. In certainimplementations, the ambient audio signal includes a voice of a user,and the control system is configured to detect the voice of the user andmodify the audio output in response to a voice command from the user. Insome implementations, the control system is further configured to:analyze the voice command from the user for a speech pattern; and modifythe audio output based upon the speech pattern in the voice command fromthe user.

In certain cases, the sensor system includes a wireless transceiverconfigured to detect an audio cache proximate the at least oneheadphone, and the control system is configured to provide anotification about the audio cache in response to the wirelesstransceiver detecting the audio cache. In particular implementations,the audio cache is stored in a local network at a geographic location orin a cloud storage system connected with the geographic location. Insome cases, the audio cache includes a song, a pre-recorded message froma user or a pre-recorded message from an information source.

In certain implementations, the control system is further configured toprovide a feedback prompt during the modifying of the audio output, thefeedback prompt including an option to: refuse the modification of theaudio output or alter the modification of the audio output.

Implementations may include one of the following features, or anycombination thereof.

Two or more features described in this disclosure, including thosedescribed in this summary section, may be combined to formimplementations not specifically described herein.

The details of one or more implementations are set forth in theaccompanying drawings and the description below. Other features, objectsand advantages will be apparent from the description and drawings, andfrom the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram depicting an example personal audio deviceaccording to various disclosed implementations.

FIG. 2 shows a schematic data flow diagram illustrating a controlprocess performed by an adaptive audio engine in the personal audiodevice of FIG. 1.

FIG. 3 shows a process flow diagram illustrating processes performed bythe adaptive audio engine shown in FIG. 2.

FIG. 4 shows a schematic depiction of a user with the personal audiodevice of FIGS. 1 and 2 in a plurality of environments.

It is noted that the drawings of the various implementations are notnecessarily to scale. The drawings are intended to depict only typicalaspects of the disclosure, and therefore should not be considered aslimiting the scope of the implementations. In the drawings, likenumbering represents like elements between the drawings.

DETAILED DESCRIPTION

This disclosure is based, at least in part, on the realization that anaudio control system can be beneficially incorporated into an audiosystem to provide for added functionality. For example, an audio controlsystem can help to enable, among other things, adaptive audio based uponenvironmental changes and predictive playback functionality.

Commonly labeled components in the FIGURES are considered to besubstantially equivalent components for the purposes of illustration,and redundant discussion of those components is omitted for clarity.

It has become commonplace for those who either listen to electronicallyprovided audio (e.g., audio from an audio source such as a mobile phone,tablet, computer, CD player, radio or MP3 player), those who simply seekto be acoustically isolated from unwanted or possibly harmful sounds ina given environment, and those engaging in two-way communications toemploy personal audio devices to perform these functions. For those whoemploy headphones or headset forms of personal audio devices to listento electronically provided audio, it is commonplace for that audio to beprovided with at least two audio channels (e.g., stereo audio with leftand right channels) to be separately acoustically output with separateearpieces to each ear. For those simply seeking to be acousticallyisolated from unwanted or possibly harmful sounds, it has becomecommonplace for acoustic isolation to be achieved through the use ofactive noise reduction (ANR) techniques based on the acoustic output ofanti-noise sounds in addition to passive noise reduction (PNR)techniques based on sound absorbing and/or reflecting materials.Further, it is commonplace to combine ANR with other audio functions inheadphones.

Aspects and implementations disclosed herein may be applicable to a widevariety of personal audio devices, such as a portable speaker,headphones, and wearable audio devices. Unless specified otherwise, theterm headphone, as used in this document, includes various types ofpersonal audio devices such as around-the-ear, over-the-ear and in-earheadsets, earphones, earbuds, hearing aids, or other wireless-enabledaudio devices structured to be positioned near, around or within one orboth ears of a user. Unless specified otherwise, the term wearable audiodevice, as used in this document, includes various types of personalaudio devices such as shoulder or body-worn acoustic devices thatinclude one or more acoustic drivers to produce sound without contactingthe ears of a user. It should be noted that although specificimplementations of personal audio devices primarily serving the purposeof acoustically outputting audio are presented with some degree ofdetail, such presentations of specific implementations are intended tofacilitate understanding through provision of examples, and should notbe taken as limiting either the scope of disclosure or the scope ofclaim coverage.

Aspects and implementations disclosed herein may be applicable topersonal audio devices that either do or do not support two-waycommunications, and either do or do not support active noise reduction(ANR). For personal audio devices that do support either two-waycommunications or ANR, it is intended that what is disclosed and claimedherein is applicable to a personal audio device incorporating one ormore microphones disposed on a portion of the personal audio device thatremains outside an ear when in use (e.g., feedforward microphones), on aportion that is inserted into a portion of an ear when in use (e.g.,feedback microphones), or disposed on both of such portions. Still otherimplementations of personal audio devices to which what is disclosed andwhat is claimed herein is applicable will be apparent to those skilledin the art.

FIG. 1 is a block diagram of an example of a personal audio device 10having two earpieces 12A and 12B, each configured to direct soundtowards an ear of a user. Reference numbers appended with an “A” or a“B” indicate a correspondence of the identified feature with aparticular one of the earpieces 12 (e.g., a left earpiece 12A and aright earpiece 12B). Each earpiece 12 includes a casing 14 that definesa cavity 16. In some examples, one or more internal microphones (innermicrophone) 18 may be disposed within cavity 16. An ear coupling 20(e.g., an ear tip or ear cushion) attached to the casing 14 surrounds anopening to the cavity 16. A passage 22 is formed through the earcoupling 20 and communicates with the opening to the cavity 16. In someexamples, an outer microphone 24 is disposed on the casing in a mannerthat permits acoustic coupling to the environment external to thecasing.

In implementations that include ANR, the inner microphone 18 may be afeedback microphone and the outer microphone 24 may be a feedforwardmicrophone. In such implementations, each earphone 12 includes an ANRcircuit 26 that is in communication with the inner and outer microphones18 and 24. The ANR circuit 26 receives an inner signal generated by theinner microphone 18 and an outer signal generated by the outermicrophone 24, and performs an ANR process for the correspondingearpiece 12. The process includes providing a signal to anelectroacoustic transducer (e.g., speaker) 28 disposed in the cavity 16to generate an anti-noise acoustic signal that reduces or substantiallyprevents sound from one or more acoustic noise sources that are externalto the earphone 12 from being heard by the user. As described herein, inaddition to providing an anti-noise acoustic signal, electroacoustictransducer 28 can utilize its sound-radiating surface for providing anaudio output for playback, e.g., for a continuous audio feed.

A control circuit 30 is in communication with the inner microphones 18,outer microphones 24, and electroacoustic transducers 28, and receivesthe inner and/or outer microphone signals. In certain examples, thecontrol circuit 30 includes a microcontroller or processor having adigital signal processor (DSP) and the inner signals from the two innermicrophones 18 and/or the outer signals from the two outer microphones24 are converted to digital format by analog to digital converters. Inresponse to the received inner and/or outer microphone signals, thecontrol circuit 30 can take various actions. For example, audio playbackmay be initiated, paused or resumed, a notification to a wearer may beprovided or altered, and a device in communication with the personalaudio device may be controlled. The personal audio device 10 alsoincludes a power source 32. The control circuit 30 and power source 32may be in one or both of the earpieces 12 or may be in a separatehousing in communication with the earpieces 12. The personal audiodevice 10 may also include a network interface 34 to providecommunication between the personal audio device 10 and one or more audiosources and other personal audio devices. The network interface 34 maybe wired (e.g., Ethernet) or wireless (e.g., employ a wirelesscommunication protocol such as IEEE 802.11, Bluetooth, Bluetooth LowEnergy, or other local area network (LAN) or personal area network (PAN)protocols).

Network interface 34 is shown in phantom, as portions of the interface34 may be located remotely from personal audio device 10. The networkinterface 34 can provide for communication between the personal audiodevice 10, audio sources and/or other networked (e.g., wireless) speakerpackages and/or other audio playback devices via one or morecommunications protocols. The network interface 34 may provide either orboth of a wireless interface and a wired interface. The wirelessinterface can allow the personal audio device 10 to communicatewirelessly with other devices in accordance with any communicationprotocol noted herein. In some particular cases, a wired interface canbe used to provide network interface functions via a wired (e.g.,Ethernet) connection.

In some cases, the network interface 34 may also include a network mediaprocessor for supporting, e.g., Apple AirPlay® (a proprietary protocolstack/suite developed by Apple Inc., with headquarters in Cupertino,Calif., that allows wireless streaming of audio, video, and photos,together with related metadata between devices) or other known wirelessstreaming services (e.g., an Internet music service such as: Pandora®, aradio station provided by Pandora Media, Inc. of Oakland, Calif., USA;Spotify®, provided by Spotify USA, Inc., of New York, N.Y., USA); orvTuner®, provided by vTuner.com of New York, N.Y., USA);network-attached storage (NAS) devices). For example, if a user connectsan AirPlay® enabled device, such as an iPhone or iPad device, to thenetwork, the user can then stream music to the network connected audioplayback devices via Apple AirPlay®. Notably, the audio playback devicecan support audio-streaming via AirPlay® and/or DLNA's UPnP protocols,and all integrated within one device. Other digital audio coming fromnetwork packets may come straight from the network media processorthrough (e.g., through a USB bridge) to the control circuit 30. As notedherein, in some cases, control circuit 30 can include a processor and/ormicrocontroller, which can include decoders, DSP hardware/software, etc.for playing back (rendering) audio content at electroacoustictransducers 28. In some cases, network interface 34 can also includeBluetooth circuitry for Bluetooth applications (e.g., for wirelesscommunication with a Bluetooth enabled audio source such as a smartphoneor tablet). In operation, streamed data can pass from the networkinterface 34 to the control circuit 30, including the processor ormicrocontroller. The control circuit 30 can execute instructions (e.g.,for performing, among other things, digital signal processing, decoding,and equalization functions), including instructions stored in acorresponding memory (which may be internal to control circuit 30 oraccessible via network interface 34 or other network connection (e.g.,cloud-based connection). The control circuit 30 may be implemented as achipset of chips that include separate and multiple analog and digitalprocessors. The control circuit 30 may provide, for example, forcoordination of other components of the personal audio device 10, suchas control of user interfaces (not shown) and applications run by thepersonal audio device 10.

In addition to a processor and/or microcontroller, control circuit 30can also include one or more digital-to-analog (D/A) converters forconverting the digital audio signal to an analog audio signal. Thisaudio hardware can also include one or more amplifiers which provideamplified analog audio signals to the electroacoustic transducer(s) 28,which each include a sound-radiating surface for providing an audiooutput for playback. In addition, the audio hardware may includecircuitry for processing analog input signals to provide digital audiosignals for sharing with other devices.

The memory in control circuit 30 can include, for example, flash memoryand/or non-volatile random access memory (NVRAM). In someimplementations, instructions (e.g., software) are stored in aninformation carrier. The instructions, when executed by one or moreprocessing devices (e.g., the processor or microcontroller in controlcircuit 30), perform one or more processes, such as those describedelsewhere herein. The instructions can also be stored by one or morestorage devices, such as one or more (e.g. non-transitory) computer- ormachine-readable mediums (for example, the memory, or memory on theprocessor/microcontroller). As described herein, the control circuit 30(e.g., memory, or memory on the processor/microcontroller) can include acontrol system including instructions for controlling adaptive audiofunctions according to various particular implementations. It isunderstood that portions of the control system (e.g., instructions)could also be stored in a remote location or in a distributed location,and could be fetched or otherwise obtained by the control circuit 30(e.g., via any communications protocol described herein) for execution.The instructions may include instructions for controlling adaptive audioprocesses (i.e., the software modules include logic for processinginputs from a user and/or sensor system to manage audio streams), aswell as digital signal processing and equalization. Additional detailsmay be found in U.S. Patent Application Publication 2014/0277644, U.S.Patent Application Publication 20170098466, U.S. Patent ApplicationPublication 20140277639, and U.S. patent application Ser. No. 62/538,853(“Conversational Audio Assistant,”), the disclosures of which areincorporated herein by reference in their entirety.

Personal audio device 10 can also include a sensor system 36 coupledwith control circuit 30 for detecting one or more conditions of theenvironment proximate personal audio device 10. Sensor system 36 caninclude one or more local sensors (e.g., inner microphones 18 and/orouter microphones 24) and/or remote or otherwise wirelessly (orhard-wired) sensors for detecting conditions of the environmentproximate personal audio device 10 as described herein. As describedfurther herein, sensor system 36 can include a plurality of distinctsensor types for detecting environmental conditions proximate thepersonal audio device 10.

According to various implementations, the audio playback devices (whichmay be, for example, personal audio device 10 of FIG. 1) describedherein can be configured to provide adaptive audio outputs based uponone or more environmental factors. These particular implementations canallow a user to experience continuous, contextual audio contentthroughout changing environments. These implementations can enhance theuser experience in comparison to conventional audio systems, e.g.,portable audio systems or audio systems spanning distinct environments.

As described with respect to FIG. 1, control circuit 30 can execute (andin some cases store) instructions for controlling adaptive audiofunctions in personal audio device 10 and/or other audio playbackdevices in a network of such devices. As shown in FIG. 2, controlcircuit 30 can include an adaptive audio engine 210 configured toimplement modifications in audio outputs at the transducer (e.g.,speaker) 28 (FIG. 1) in response to a change in environmentalconditions. In various particular embodiments, adaptive audio engine 210is configured to receive data about an environmental condition fromsensor system 36, and modify the audio output at transducer(s) 28 inresponse to a change in the environmental condition. In particularimplementations, the audio output includes a continuous audio outputprovided across a transition between environmental conditions, which isconfigured to vary with the change(s) in environmental condition.

In particular, FIG. 2 shows a schematic data flow diagram illustrating acontrol process performed by adaptive audio engine 210 in connectionwith a user 225. It is understood that in various implementations, user225 can include a human user. FIG. 3 shows a process flow diagramillustrating processes performed by adaptive audio engine 210 accordingto various implementations. FIG. 4 shows a schematic depiction of a user225 in distinct environments 400 a, 400 b, 400 c, etc. to illustrateexample operations of adaptive audio engine 210 and personal audiodevice 10. FIGS. 1-4 are referred to simultaneously, with particularemphasis on FIGS. 2-4.

Returning to FIG. 2, data flows between adaptive audio engine 210 andother components in personal audio device 10 are shown. It is understoodthat one or more components shown in the data flow diagram may beintegrated in the same physical housing, e.g., in the housing ofpersonal audio device 10, or may reside in one or more separate physicallocations.

According to various implementations, control circuit 30 includes theadaptive audio engine 210, or otherwise accesses program code forexecuting processes performed by adaptive audio engine 210 (e.g., vianetwork interface 34). Adaptive audio engine 210 can include logic forprocessing environmental data 230 from sensor system 36 and providing acontinuous audio output (i.e., audio stream or feed) 235 to user 225that varies across distinct environments 400 a, 400 b, 400 c, etc. (FIG.4). In some cases, this logic can include environmental data processinglogic 240, library lookup logic 245 and feedback logic 250.

Adaptive audio engine 210 can be coupled (e.g., wirelessly and/or viahardwired connections in personal audio device 10) with an audio library255, which can include audio files 260 for playback (e.g., streaming) atpersonal audio device 10 and/or a profile system 265 including userprofiles 270 about one or more user(s) 225. Audio library 255 caninclude any library associated with digital audio sources accessible vianetwork interface 34 (FIG. 1) described herein, including locallystored, remotely stored or Internet-based audio libraries. User profiles270 may be user-specific, community-specific, device-specific,location-specific or otherwise associated with a particular entity suchas user 225. User profiles 270 can include user-defined playlists ofdigital music files available from network audio sources coupled withnetwork interface 34 (FIG. 1), such as network-attached storage (NAS)devices, and/or a DLNA server, which may be accessible to the personalaudio device 10 (FIG. 1) over a local area network such as a wireless(Wi-Fi) or wired (Ethernet) home network, as well as Internet musicservices such as Pandora®, vTuner®, Spotify®, etc., which are accessibleto the audio personal audio device 10 over a wide area network such asthe Internet. In some cases, profile system 265 is located in a localserver or a cloud-based server, similar to any such server describedherein. User profile 270 may include information about frequently playedaudio files associated with user 225 or other similar users (e.g., thosewith common audio file listening histories, demographic traits orInternet browsing histories), “liked” or otherwise favored audio filesassociated with user 225 or other similar users, frequency with whichparticular audio files are changed by user 225 or other similar users,etc. Profile system 265 can be associated with any community of users,e.g., a social network, subscription-based music service (such as aservice providing audio library 255), and may include audio preferences,histories, etc. for user 225 as well as a plurality of other users.

Adaptive audio engine 210 can also be coupled with a smart device 275that has access to a user profile (e.g., profile 270) or biometricinformation about user 225. It is understood that smart device 275 caninclude one or more personal computing devices (e.g., desktop or laptopcomputer), wearable smart devices (e.g., smart watch, smart glasses), asmart phone, a remote control device, a smart beacon device (e.g., smartBluetooth beacon system), a stationary speaker system, etc. Smart device275 can include a conventional user interface for permitting interactionwith user 225, and can include one or more network interfaces forinteracting with control circuit 30 and other components in personalaudio device 10 (FIG. 1). In some example implementations, smart device275 can be utilized for: connecting personal audio device 10 to a Wi-Finetwork; creating a system account for the user 225; setting up musicservices; browsing of content for playback; setting preset assignmentson the personal audio device 10 or other audio playback devices;transport control (e.g., play/pause, fast forward/rewind, etc.) for thepersonal audio device 10; and selecting one or more personal audiodevices 10 for content playback (e.g., single room playback orsynchronized multi-room playback). In some cases smart device 275 mayalso be used for: music services setup; browsing of content; settingpreset assignments on the audio playback devices; transport control ofthe audio playback devices; and selecting personal audio devices 10 (orother playback devices) for content playback. Smart device 275 canfurther include embedded sensors for measuring biometric informationabout user 225, e.g., travel, sleep or exercise patterns; bodytemperature; heart rate; or pace of gait (e.g., via accelerometer(s).

Adaptive audio engine 210 is configured to receive environmental data230 about distinct environments 400 a, 400 b, 400 c, etc. (FIG. 4) fromsensor system 36. Environmental data 230 is described herein withreference to the various forms of sensor system 36 configured forsensing such data.

As shown in FIG. 2, sensor system 250 can include one or more of thefollowing sensors 280: a position tracking system 282; anaccelerometer/gyroscope 284; a microphone (e.g., including one or moremicrophones) 286 (which may include or work in concert with microphones18 and/or 24); and a wireless transceiver 288. These sensors are merelyexamples of sensor types that may be employed according to variousimplementations. It is further understood that sensor system 36 candeploy these sensors in distinct locations and distinct sub-componentsin order to detect particular environmental information relevant to user225.

The position tracking system 282 can include one or more location-baseddetection systems such as a global positioning system (GPS) locationsystem, a Wi-Fi location system, an infra-red location system, aBluetooth beacon system, etc. Position tracking system 282 can beconfigured to detect changes in the physical location of the personalaudio device 10 and/or user 225 (where user 225 is separated frompersonal audio device 10) and provide updated environmental data 230 tothe adaptive audio engine 210 in order to indicate a change in theenvironment 400 a, 400 b, 400 c, etc. proximate user 225 e.g., a changein location of the user 225 or a change in a condition proximate user.In some example implementations, this position tracking system 282 candetect that user 225 has moved from a rural location to an urbanlocation, or that user 225 has walked from a city street into his/heroffice, or that user 225 has reached a location proximate to a locationof interest (e.g., a landmark, a location linked with the calendar ofuser 225 or profile 270 of user, or a location previously visited byuser 225, friends of user 225 or other similar users in a socialnetwork).

The accelerometer/gyroscope 284 can include distinct accelerometercomponents and gyroscope components, or could be collectively housed ina single sensor component. This component may be used to sense gesturesbased on movement of the user's body (e.g., head, torso, limbs) whilethe user is wearing the personal audio device 10 or interacting withanother device (e.g., smart device 275) connected with personal audiodevice 10. As with any sensor in sensor system 36,accelerometer/gyroscope 284 may be housed within personal audio device10 or in another device connected to the personal audio device 10. Insome example implementations, the accelerometer/gyroscope 284 can detectan acceleration of the user 225 and/or personal audio device 10 or adeceleration of the user 225 and/or personal audio device 10.

The microphone 286 (which can include one or more microphones, or amicrophone array) can have similar functionality as the microphone(s) 18and 24 shown and described with respect to FIG. 1, and may be housedwithin personal audio device 10 or in another device connected to thepersonal audio device 10. As noted herein, microphone 286 may include orotherwise utilize microphones 18 and 24 to perform functions describedherein. Microphone 286 can be positioned to receive ambient audiosignals (e.g., audio signals proximate personal audio device 10). Insome cases, these ambient audio signals include speech/voice input fromuser 225 to enable voice control functionality. In some other exampleimplementations, the microphone 286 can detect the voice of user 225and/or of other users proximate to or interacting with user 225. Inparticular implementations, adaptive audio engine 210 is configured toanalyze one or more voice commands from user 225 (via microphone 286)for a speech pattern, and modify audio output 235 based upon thatidentified speech pattern.

For example, environmental data processing logic 240 in adaptive audioengine 210 can include logic for analyzing vocal patterns orvoice-to-text pattern recognition (e.g., natural language processing(NLP) logic or other similar logic) for detecting a speech pattern inthe voice of user 225. In these cases, adaptive audio engine 210 canmodify audio output 235 to provide a match or best-match for the user'sspeech pattern, e.g., where an audio prompt or other command is providedto user 225, and that audio output 235 can be presented to the user 225in a similar speech pattern as the pattern detected at microphone 286.In particular implementations, where the user has a particular dialect,vocabulary or language preference, adaptive audio engine 210 can modifythe audio output to match that speech pattern.

Returning to sensor system 36, wireless transceiver 288 (comprising atransmitter and a receiver) can include, for example, a Bluetooth (BT)or Bluetooth Low Energy (BTLE) transceiver or other conventionaltransceiver device, and may be configured to communicate with othertransceiver devices in distinct environments 400 a, 400 b, 400 c, etc.(FIG. 4). In some example implementations, wireless transceiver 288 canbe configured to detect an audio cache proximate personal audio device10, e.g., in a local network at a geographic location or in a cloudstorage system connected with the geographic location. For example,another user, a business establishment, government entity, tour group,etc. could leave an audio cache (e.g., a song or pre-recorded message)at particular geographic (or virtual) locations, and wirelesstransceiver 288 can be configured to detect this cache and prompt user225 regarding playback of the cache file.

It is understood that any number of additional sensors 290 could beincorporated in sensor system 36, and could include temperature sensorsor humidity sensors for detecting changes in weather withinenvironments, optical/laser-based sensors and/or vision systems fortracking movement or speed, light sensors for detecting time of day,additional audio sensors (e.g., microphones) for detecting human orother user speech or ambient noise, etc.

As noted herein, adaptive audio engine 210 can include logic forperforming audio control functions according to various implementations.FIG. 3 shows a flow diagram illustrating processes in adaptive audiocontrol performed by adaptive audio engine 210 and its associated logic.FIG. 4 illustrates a single user 225 in distinct environments 400 a, 400b, 400 c and 400 d, which may include distinct physical locations, ormay include the same physical location with a distinct environmentalcondition. For example, environment 400 a could include the interior ofa user's office (e.g., sitting at his/her desk), while environment 400 bcould include the interior of that same user's automobile (e.g., sittingin the driver's seat). Environment 400 c could include the same physicallocation as environment 400 a (user's office), but could be distinct interms of the ambient audio at that location. For example, sensor system36 (e.g., microphone 286) could detect the voice of user 225 and anotheruser and determine that a conversation is ongoing in the office. Thisenvironmental condition (represented as environmental data 230) cancause a change in environment 400 a, versus environment 400 c, while notchanging the physical location of user 225 and/or personal audio device10. FIG. 3 is referred to concurrently with FIG. 2.

Turning to the process flow in FIG. 3, adaptive audio engine 210receives environmental data 230 from sensor system 36 (process 310), andmodifies the continuous audio output 235 to headphone 28 (in personalaudio device 10) in response to a change in the environmental data 230(process 320). As noted herein, the audio output 235 includes acontinuous audio output provided across a transition betweenenvironmental conditions and is configured to vary with the detectedchange in environmental condition (e.g., changes from environment 400 ato environment 400 b and then to environment 400 c).

Environmental data 230 can include data about one or more environmentalconditions detected by sensor system 36, and may include data about aplurality of environmental conditions. For example, environmental data230 could include data about a position of the personal audio device 10(e.g., from position tracking system 282) data about an acceleration ofpersonal audio device 10 (e.g., from accelerometer/gyroscope 284), dataabout the ambient audio conditions proximate personal audio device 10(e.g., from microphone 286) or data about nearby audio, video or otherdata caches (e.g., from wireless transceiver 288).

Returning to FIG. 2, in various implementations, environmental dataprocessing logic 240 is configured to process the environmental data 230and provide a weighted environmental representation to library lookuplogic 245 to enable fetching a type of audio file 260 for providing inthe continuous audio output 235. That is, environmental data processinglogic 240 can include weightings or factoring for one or more of userpreferences (e.g., user profile(s) 270), sensor data about past events(e.g., position and/or acceleration information about personal audiodevice 10 over given periods), audio files (e.g., audio samples ofuser's voices, as sampled by microphone 286), and other readilyavailable data (e.g., a demographic profile of a plurality of users withat least one common attribute with the user or a categorical popularityof an audio file 260). The weighted environmental representation mayindicate a general characteristic of the environment 400 a, 400 b, 400c, etc., as a combination of factors from environmental (sensor) data230, profile(s) 270 and/or information from smart device 275.

After processing the environmental data 230 with environmental dataprocessing logic 240, library lookup logic 245 can search audio library255 for files 260 using the weighted environmental representation fromenvironmental data processing logic 240. Library lookup logic 245 mayinclude a relational database with relationships between the weightedenvironmental representation and audio files 260. As noted herein, audiolibrary 255 can be locally stored at personal audio system 10 (FIG. 1)and/or stored at one or more remote or cloud-based servers. Librarylookup logic 245 can be continually updated based upon changes in audiolibrary 255 in order to provide accurate, timely associations betweenthe weighted environmental representation from language processing logic245 and audio files 260.

In some example implementations, adaptive audio engine 210 (e.g., usingenvironmental data processing logic 240 and/or language processing logic245) is configured to perform one or more of the following logicprocesses using environmental data 230 and/or other data accessible viaprofile system 265, smart device 275, etc.: speech recognition, speakeridentification, speaker verification, word spotting (e.g., wake worddetection), speech end pointing (e.g., end of speech detection), speechsegmentation (e.g., sentence boundary detection or other types of phrasesegmentation), speaker diarization, affective emotion classification onvoice, acoustic event detection, two-dimensional (2D) orthree-dimensional (3D) beam forming, source proximity/location, volumelevel readings, acoustic saliency maps, ambient noise level datacollection, signal quality self-check, gender identification (ID), ageID, echo cancellation/barge-in/ducking, language identification, and/orother environmental classification such as environment type (e.g., smallroom, large room, crowded street, etc.; and quiet or loud).

After library lookup logic 245 selects the audio file 260, that audiostream is provided (i.e., rendered) at transducer 28 (FIG. 1) as part ofcontinuous audio output 235. In some examples, audio output 235 isconfigured to transition between audio files 260 in a fade-out/fade-informat, but in some other implementations, the transition between audiofiles 260 can be abrupt. In other implementations, the audio output 235can include an introduction to an audio file 260 (e.g., the beginning ofa song or other audio feed (e.g., a sporting event broadcast, an audiofile of a movie, interview, documentary, educational programming, etc.),or a portion of such a song or other audio file 260 selected torepresent that song or audio file 260 (e.g., the chorus of a song)).

Additionally, the adaptive audio engine 210 is configured to provide aprompt 292 for feedback about the audio output 235 along with, prior to,or after the transition between audio files 260. The prompt 292 caninclude any prompt described herein, such as via a user interface orother interface (e.g., a user input interface on smart device 275), ormay include an audio prompt (provided via speaker 28 with responsecapable via microphone 286 or microphones 18 or 24 (FIG. 1)) forfeedback about the transition between audio files 260 in continuousaudio output 235.

In various embodiments, where prompt 292 is an audio prompt, it caninclude a phrase such as “Did you enjoy this transition?”, “Would youlike to continue this audio stream?”, or “Would you like to hear thisaudio cache stored by your friend at this location?” Prompt 292 caninclude any spoken phrase, word or clause intended to elicit a responsefrom user 225, or can include a displayed prompt (e.g., similar to audiophrase, or with a Yes/No/Maybe or other visual prompt with touch-screenor push-button response capabilities), such as a prompt displayed atsmart device 275 or other device within range of user 225. In variousimplementations, prompt 292 can be provided to the user 225 without anintervening audio input from user 225, such that user 225 is notrequired to prompt adaptive audio engine 210 (e.g., by using aninitiation term such as a name) in order to provide feedback. That is,the control circuit 30 can maintain the microphone(s) 286 in a querymode during the transition in audio files 260, such that the systemactively awaits a response from the user 225. In some implementations,microphone(s) 286 can remain in an optional response mode whileproviding the audio output 235 and/or prompt 292. That is, controlcircuit 30 can maintain microphone(s) 286 in a listen mode for a setperiod, with an expectation that user 225 may or may not respond (e.g.,with a “Thanks,” compliment or other feedback about audio output 235and/or prompt 292).

In various implementations, the audio output 235 is provided through theat least one speaker 28 (FIG. 1) at a decibel level ranging betweenapproximately 50-70 decibels (dB). That is, audio output 235 can beprovided to user 225 at a decibel level range which falls within thespectrum of “background” sound relative to the ambient sound inenvironments 400 a, 400 b, 400 c, etc. In particular implementations,audio output 235 is intended to act as a “soundtrack” for user 225 ashe/she travels through differing environments 400 a, 400 b, 400 c, etc.In this sense, audio output 235, which may be rendered at personal audiodevice 10 and/or other playback devices described herein, is provided touser 225 as he/she walks/drives/runs/bicycles through daily activities.In various particular cases, user 225 experiences a changing audiooutput 235 (e.g., audio stream) in a seamless manner between distinctenvironments 400 a, 400 b, 400 c, etc. However, due to the varyingnature of environments 400 a, 400 b, 400 c, etc., audio output 235 maybe set at a decibel level such that it does not interfere with audiowithin each particular environment.

Additionally, adaptive audio engine 210 can modify the decibel level ofor intensity of the audio output 235 in response to detecting a changein one or more environmental conditions from sensor system 36, e.g., byreducing decibel level or intensity level (e.g., type of audio file 260)when detecting the user's voice or another voice at microphone 286, byincreasing decibel level (or increasing intensity by switching to anup-tempo audio file 260) when accelerometer/gyroscope 284 and/orposition tracking system 282 indicate user 225 is moving at aparticularly fast pace, or by reducing decibel level when providing aprompt 292 (e.g., “Sue has left an audio cache at this location, wouldyou like to hear it?”, “Mary and Steve listened to this song the lasttime they visited this restaurant, would you like to hear it?”, or “Yourlocation indicates you are traveling to Boston, would you like to listento an audio introduction to the city's popular sights?”).

In some cases, prompt 292 is provided at an approximately equal orgreater decibel level as audio output 235, in order to enhance thechances that user 225 recognizes the prompt 292 and provides usefulfeedback. In some particular embodiments, audio output 235 is provided(i.e., rendered) at transducer 28 (FIG. 1) at a first decibel level,then the audio output 235 is faded out to a reduced decibel level whileprompt 292 is provided, and audio output 235 is then faded in to theoriginal decibel level while adaptive audio engine 210 awaits a responseto prompt 292 at microphone 286 or other interface. In variousembodiments, the prompt 292 can trail the transition in audio files 260in audio output 235 or overlap with only a portion of the new audio file260 to allow user 225 to consider the new audio file 260 prior toproviding feedback.

With continuing reference to FIGS. 2-4, according to variousimplementations, adaptive audio engine 210 can further include feedbacklogic 250 for receiving and processing feedback 294 from user 225(process 330). In some cases, feedback 294 includes negative feedbackfrom user 225 about the change in audio output 235 (e.g., “I don't likethis song”, “No”, “Change”, “Next” or a similar response to a userinterface prompt such as a thumbs-down, “No” vote, etc.). In othercases, feedback 294 includes positive feedback from user 225 about theaudio sample 240 (e.g., “Yes”, “Good”, “Continue” or a similar responseto a user interface prompt such as a thumbs-up, “Yes” vote, etc.). Invarious implementations, user 225 can provide either a verbal responseor a response to a user interface prompt.

In some cases, feedback 294, or other audio inputs (e.g., environmentaldata 230) includes an audio signal, and may be analyzed using acousticfeature extraction for one or more features including: energy,zero-crossing rate, mel-frequency cepstral coefficients, spectralflatness, summary statistics (e.g., mean, variance, skew or kurtosis) onany signal measurement, tempo/beats-per-minute and acousticfingerprinting. In some cases, audio files 260 can include “text”metadata, which can allow adaptive audio engine 210 to perform metadatafeature extraction on audio file(s) 260. This metadata featureextraction can include, for example, matching and linking features to adatabase (e.g., audio library 255) and/or retrieving/analyzingadditional audio and semantic attributes of the audio file(s) 260, e.g.,genre, mood, themes or related artists. Adaptive audio engine 210 (andlogic therein) can use these acoustic features from feedback 294 orother audio inputs, and metadata features from audio files 260, toperform statistical and probabilistic modeling in order to recommendother similar audio file(s) 260 and/or recommend audio streams (e.g.,radio stations, albums, playlists or artists).

In response to the user 225 providing negative feedback about the newaudio file 260 in audio output 235, adaptive audio engine 210 isconfigured to either provide an additional new file 260 to user 225along with an additional prompt 292 for feedback (process 340, FIG. 3),or continue to play the previous audio file 260 (process 350, FIG. 3).This decision can be made by adaptive audio engine 210 based upon theuser profile 270 associated with user 225 or with other user settings.For example, as shown in FIG. 3, in some cases, where user 225 providesnegative feedback about the modified audio output 235 (new audio file260), adaptive audio engine 210 may check user profile 270 to determinewhether user has settings for playing an additional new audio file 260(Set: New) in response to negative feedback, which may include anadditional feedback prompt (process 340), or whether the profile 270includes settings (Set: Continue) for continuing the previous audio file260 in response to negative feedback (process 350). In some cases, adefault setting could include playing the previous audio file (process350).

In response to the user 225 providing positive feedback about the newaudio file 260 in continuous audio output 235, adaptive audio engine 210is configured to continue an audio feed of the new audio file 260(process 360, FIG. 3). In these cases, the new audio file 260 may bemodified to play from the beginning of the particular stream (e.g.,beginning of a song, where the sample was taken from a distinct portionof the song), or may continue to play through its entirety unlessotherwise modified by the user 225. In various particularimplementations, a positive response to the prompt 292 can include anull response (e.g., no verbal command from user 225) after a waitingperiod (e.g., 10 seconds, 20 seconds).

As shown in FIG. 2, feedback logic 250 is connected with environmentaldata processing logic 240 and can process feedback 294 and provide thatfeedback 294 to environmental data processing logic 240 and librarylookup logic 245. In some cases, feedback logic 250 can be configured toteach environmental data processing logic 240 and library lookup logic245 about preferences of user 225, e.g., where one or more of theselogic components includes an artificial intelligence (AI) component foriteratively refining logic operations to enhance the accuracy of itsresults. Example AI components could include machine learning logic, aneural network including an artificial neural network, a naturallanguage processing engine, a deep learning engine, etc. In any case,feedback logic 250 can be configured to analyze feedback 294 and enhancefuture operations of adaptive audio engine 210. It is further understoodthat feedback logic 250, library lookup logic 245 and/or environmentaldata processing logic 240 may be interconnected in such a manner thatthese components act in concert or in reliance upon one another.

In some particular embodiments, as illustrated in FIG. 3, adaptive audioengine 210 is configured to provide an explanation for selection of thenew audio file 260 in continuous audio output 235 in response toreceiving either positive or negative feedback about the change in audiooutput 235 from user 225 (process 370, shown in phantom as optional).The explanation can include another audio stream, e.g., similar toprompt 292, and may include a phrase such as, “This song was selectedbecause you liked another song by this artist,” or “Your friendslistened to this song the last time they visited this location,” or “Youlistened to this soundtrack on your last jog down this route and youexceeded your previous best time.” In these cases, user 225 may electparticular explanation settings (e.g., in user profile 270 or in othersettings) such that explanations do not overwhelm the experience, e.g.,such that explanations are only provided in response to negativefeedback, or that explanations are only provided after a plurality ofnegative feedback responses. Additionally, explanations could includeadditional prompts (or act as prompts themselves) to allow user 225 toverify that particular selection criteria are relevant to his/herinterests. In various implementations, prompt(s) 292 (includingexplanations, suggestions, etc.) and feedback 294, can be exchangedbetween the user 225 and the adaptive audio engine 210 via real-timeaudio exchange (e.g., audio inputs/outputs via microphones andspeakers), real-time exchange via smart device (e.g., via userinterface(s) on one or more smart devices 275) or via these mechanismsat a later (delayed) time.

In some implementations, adaptive audio engine 210 is configured to workin concert with sensor system 36 to continually monitor changes in oneor more environmental conditions. In some cases, sensor system 36 may beset in an active mode, such as where position tracking system 282 pingsnearby Wi-Fi networks to triangulate location of the personal audiodevice 10, or microphone 286 (or microphones 18 and/or 24) remains in a“listen” mode for particular ambient sounds. In other implementations,sensor system 36 and adaptive audio engine 210 can be configured in apassive mode, such as where wireless transceiver 288 detects signalstransmitted from nearby transceiver devices or network devices. In stillother implementations, distinct sensors in the sensor system 36 can beset in distinct modes for detecting changes in environmental conditionsand transmitting updated environmental data 230 to adaptive audio engine210. For example, some sensors in sensor system 36 can remain in anactive mode while audio device 10 is active (e.g., powered on), whileother sensors may remain in a passive mode for triggering by an event.

As noted herein, in contrast to conventional audio systems, the personalaudio device 10 disclosed according to various implementations can allowa user 225 to experience a continuous, contextual audio stream across avariety of environments 400 a, 400 b, 400 c, etc. For example, as shownin FIG. 4, a user 225 can move between distinct environments 400 a, 400b, 400 c and 400 d, or those environments may change in terms of one ormore environmental conditions (e.g., time of day, ambient noise level,temperature) while the user 225 experiences a continuous audio output235 that varies across a plurality of environments 400 a, 400 b, 400 c(e.g., changes with a transition between environments or activelyprovides the option to transition between environments).

First Environment

In some implementations, environment 400 a could include a live sportingevent, such as a baseball game in the City of Boston. One or moresensors (e.g., position tracking system 282, FIG. 2) could detect thatuser 225 is physically located within a baseball park during a baseballgame (e.g., where game-time data is pulled by additional sensors or viaInternet search or calendar data via smart device 275). Adaptive audioengine 210 could select an audio file 260 that is appropriate for thebaseball game, for example, an audio stream of the radio broadcast on alocal sports network. As described herein, the audio output 235 of thisfile may be at a decibel level such that user 225 has the ability tohear the audio stream relative to the surrounding noise (e.g., asdetected by microphone 286, or microphone(s) 18 or 24 (FIG. 1)) withoutoverwhelming the live experience, that is, without interfering with theambient audio at the baseball game. User 225 can pre-define this decibellevel (e.g., in profile 270), or may adjust this decibel leveldynamically using any control device described herein.

Second Environment

As the game concludes, user 225 walks out of the baseball park and intoanother environment 1100 b adjacent the baseball park (e.g., onto astreet such as Yawkey Way). Sensor system 36 may detect (e.g., viaposition tracking system 282 and/or accelerometer/gyroscope 284) thatuser 225 is picking up pace and walking quickly, and send correspondingenvironmental data 230 to adaptive audio engine 210. The adaptive audioengine 210 may process this environmental data 230 and select a newaudio file 260 for audio output 235, such as an up-tempo rock song bythe Dropkick Murphys or the band Boston. As the user's pace increases,adaptive audio engine 210 could increase the decibel level of the audiooutput 235, e.g., based upon settings in user profile 270 or additionalenvironmental data 230 such as the level of ambient noise as measured bymicrophone 286. Additionally, adaptive audio engine 210 could infer,based on the user's prior habits, that the user is walking towardshis/her neighborhood, and select an audio playlist that is of anappropriate duration for the user's walk, so that the user can completelistening to the full playlist before entering the next environment.

Third Environment

As the user 225 leaves the area proximate the ballpark and enters aquiet residential area in environment 400 c, sensor system 36 may detect(e.g., via position tracking system 282 and/or accelerometer/gyroscope284) that user 225 is slowing down in pace, and that rain is starting tofall (e.g., via microphone 286, data from smart device 275 or additionalsensors 290 such as humidity sensors), and send correspondingenvironmental data 1100 to adaptive audio engine 210. The adaptive audioengine 210 may process this environmental data 230 and select a newaudio file 260 for audio output 235 in environment 400 c, such as a songabout rain (e.g., “Who'll Stop The Rain” by Creedence ClearwaterRevival) or a weather broadcast about current or future weatherconditions in the area.

Fourth Environment

As the user 225 approaches his/her home and walks through the door inenvironment 400 d, sensor system 36 may detect this location (e.g., viaposition tracking system 282, wireless transceiver 288, and/oradditional sensors 290 detecting a home Wi-Fi network or proximity toother electronic devices known to be in user's home) and sendcorresponding environmental data 230 to adaptive audio engine 210. Theadaptive audio engine 210 may process this environmental data 230 andselect a new audio file 260 for audio output 235 in environment 400 d,such as a relaxing and/or familiar song liked by the user 225, e.g.,from profile settings in profile 270 or frequently played via one ormore streaming services connected with audio library 255 (e.g., “Take MeHome, Country Roads” by John Denver). In some particularimplementations, adaptive audio engine 210 could transfer the audiooutput 235 to a different device, e.g., as the location of user 225changes. In this example, when user 225 enters his/her home, the audiooutput 235 could be transitioned from a wearable device (e.g., headphonesystem) to a playback device located in the home (e.g., a speakersystem).

In this sense, user 225 traverses multiple environments with acontinuous, contextual audio stream tailored to his/her preferences andthe changing environments. Because the user 225 is not required toprompt adaptive audio engine 210 to modify the continuous audio stream,user 225 can focus on additional stimuli in the surroundingenvironments, and enjoy enhanced transitions between distinctenvironments.

The above-noted example process provides several location-based examplesof changes in environments, however, as noted herein, environments (andassociated environmental data 230) can change according to anymeasurable parameter that can be detected by sensor system 36. Whileparticular types of sensors, and modes of operation (e.g., active v.passive detection) may be more practicable than others with givenhardware constraints (e.g., in a personal audio device 10), it isunderstood that any sensor described herein can be employed by sensorsystem 36 to detect a change in environment around user 225.

The functionality described herein, or portions thereof, and its variousmodifications (hereinafter “the functions”) can be implemented, at leastin part, via a computer program product, e.g., a computer programtangibly embodied in an information carrier, such as one or morenon-transitory machine-readable media, for execution by, or to controlthe operation of, one or more data processing apparatus, e.g., aprogrammable processor, a computer, multiple computers, and/orprogrammable logic components.

A computer program can be written in any form of programming language,including compiled or interpreted languages, and it can be deployed inany form, including as a stand-alone program or as a module, component,subroutine, or other unit suitable for use in a computing environment. Acomputer program can be deployed to be executed on one computer or onmultiple computers at one site or distributed across multiple sites andinterconnected by a network.

Actions associated with implementing all or part of the functions can beperformed by one or more programmable processors executing one or morecomputer programs to perform the functions of the calibration process.All or part of the functions can be implemented as, special purposelogic circuitry, e.g., an FPGA and/or an ASIC (application-specificintegrated circuit). Processors suitable for the execution of a computerprogram include, by way of example, both general and special purposemicroprocessors, and any one or more processors of any kind of digitalcomputer. Generally, a processor will receive instructions and data froma read-only memory or a random access memory or both. Components of acomputer include a processor for executing instructions and one or morememory devices for storing instructions and data.

In various implementations, components described as being “coupled” toone another can be joined along one or more interfaces. In someimplementations, these interfaces can include junctions between distinctcomponents, and in other cases, these interfaces can include a solidlyand/or integrally formed interconnection. That is, in some cases,components that are “coupled” to one another can be simultaneouslyformed to define a single continuous member. However, in otherimplementations, these coupled components can be formed as separatemembers and be subsequently joined through known processes (e.g.,soldering, fastening, ultrasonic welding, bonding). In variousimplementations, electronic components described as being “coupled” canbe linked via conventional hard-wired and/or wireless means such thatthese electronic components can communicate data with one another.Additionally, sub-components within a given component can be consideredto be linked via conventional pathways, which may not necessarily beillustrated.

A number of implementations have been described. Nevertheless, it willbe understood that additional modifications may be made withoutdeparting from the scope of the inventive concepts described herein,and, accordingly, other embodiments are within the scope of thefollowing claims.

We claim:
 1. A wearable audio device, comprising: an acoustic transducerhaving a sound-radiating surface for providing an audio output; at leastone microphone configured to detect a characteristic of an ambient audiosignal proximate the wearable audio device; and a control system coupledwith the acoustic transducer and the at least one microphone, thecontrol system configured to: receive data about the characteristic ofthe ambient audio signal from the at least one microphone; and modifythe audio output at the acoustic transducer in response to a change inthe characteristic of the ambient audio signal, wherein the audio outputincludes a continuous audio output provided across a transition betweencharacteristics of the ambient audio signal and is configured to varywith the change in the characteristic of the ambient audio signal,wherein the continuous audio output comprises playback at the acoustictransducer of a plurality of audio files or audio streams thattransition between distinct audio files or audio streams.
 2. Thewearable audio device of claim 1, wherein the ambient audio signalcomprises a voice command from a user of the wearable audio device, andwherein the control system is configured to: analyze the voice commandfrom the user for a speech pattern; and modify the continuous audiooutput based upon the speech pattern in the voice command from the user,wherein the control system comprises environmental data processing logicfor analyzing vocal patterns in the voice of the user of the wearableaudio device, or voice-to-text pattern recognition for detecting thespeech pattern in the voice of the user.
 3. The wearable audio device ofclaim 2, wherein modifying the continuous audio output based upon thespeech pattern comprises modifying the continuous audio output to matchthe speech pattern in a voice of the user or provide a best-match forthe speech pattern in the voice of the user.
 4. The wearable audiodevice of claim 1, wherein the continuous audio output transitionsbetween the distinct audio files or audio streams in response todetecting the change in the characteristic of the ambient audio signal.5. The wearable audio device of claim 1, wherein the transition betweendistinct audio files or audio streams does not require a user interfacecommand.
 6. The wearable audio device of claim 1, wherein the change inthe characteristic of the ambient audio signal occurs at a same physicallocation.
 7. The wearable audio device of claim 1, wherein the controlsystem is configured to determine that a conversation is ongoing inresponse to detecting at least one of: a) a voice of a user and a voiceof another user in the ambient audio signal, or b) a close proximity orlocation of a voice of another user in the ambient audio signal.
 8. Thewearable audio device of claim 1, wherein the control system isconfigured to modify a decibel level or an intensity of the continuousaudio output in response to detecting a voice of a user of the wearableaudio device or a voice of another user in the ambient audio signal. 9.The wearable audio device of claim 1, wherein the continuous audiooutput is configured to vary in terms of at least one of: decibel level,intensity, echo cancellation, barge-in, ducking, or noise cancellation.10. The wearable audio device of claim 1, wherein the continuous audiooutput is at a decibel level ranging between approximately 50-70decibels (dB), wherein the control system is coupled with a smart devicehaving access to a user profile or biometric information about a user,wherein the control system is further configured to modify thecontinuous audio output based upon the user profile or the biometricinformation about the user.
 11. The wearable audio device of claim 1,wherein the control system is configured to detect a decibel level ofthe ambient audio signal and output the continuous audio output at adecibel level that permits a user of the wearable audio device to hearthe ambient audio signal and the continuous audio output.
 12. Thewearable audio device of claim 1, wherein the control system is furtherconfigured to: receive feedback from a user of the wearable audio deviceafter transitioning between the audio files or audio streams in theaudio output; and in response to receiving negative feedback from theuser about the transition between the audio files or audio streams,either: a) reverting back to an immediately preceding audio file oraudio stream, or b) transitioning to an additional new audio file oraudio stream.
 13. A computer-implemented method of controlling awearable audio device configured to provide an audio output, the methodcomprising: receiving data about a characteristic of an ambient audiosignal proximate the wearable audio device as detected by at least onemicrophone; and modifying the audio output at the wearable audio devicein response to a change in the characteristic of the ambient audiosignal, wherein the audio output includes a continuous audio outputprovided across a transition between detected changes in thecharacteristic of the ambient audio signal and is configured to varywith the change in the characteristic of the ambient audio signal,wherein the continuous audio output comprises playback at the wearableaudio device of a plurality of audio files or audio streams thattransition between distinct audio files or audio streams in response todetecting the change in the characteristic of the ambient audio signal.14. The computer-implemented method of claim 13, wherein the ambientaudio signal comprises a voice command from a user of the wearable audiodevice, and wherein the control system is configured to: analyze thevoice command from the user for a speech pattern; and modify thecontinuous audio output based upon the speech pattern in the voicecommand from the user, wherein modifying the continuous audio outputbased upon the speech pattern comprises modifying the continuous audiooutput to match the speech pattern in the voice of the user or provide abest-match for the speech pattern in the voice of the user.
 15. Thecomputer-implemented method of claim 14, further comprising analyzingvocal patterns in the voice of the user of the wearable audio device, orvoice-to-text pattern recognition for detecting the speech pattern inthe voice of the user.
 16. The computer-implemented method of claim 13,wherein the transition between distinct audio files or audio streamsdoes not require a user interface command, and wherein the change in thecharacteristic of the ambient audio signal occurs at a same physicallocation.
 17. The computer-implemented method of claim 13, furthercomprising determining that a conversation is ongoing in response todetecting at least one of: a) a voice of a user and a voice of anotheruser in the ambient audio signal, or b) a close proximity or location ofa voice of another user in the ambient audio signal.
 18. Thecomputer-implemented method of claim 13, wherein the control system isconfigured to modify a decibel level or an intensity of the continuousaudio output in response to detecting a voice of a user of the wearableaudio device or a voice of another user in the ambient audio signal, andwherein modifying the decibel level or the intensity of the continuousaudio output comprises reducing the decibel level or the intensity ofthe continuous audio output.
 19. The computer-implemented method ofclaim 13, wherein the continuous audio output is at a decibel levelranging between approximately 50-70 decibels (dB), wherein the methodfurther includes: detecting a decibel level of the ambient audio signal;and outputting the continuous audio output at a decibel level thatpermits a user of the wearable audio device to hear the ambient audiosignal and the continuous audio output.
 20. The computer-implementedmethod of claim 13, further comprising: receiving feedback from a userof the wearable audio device after transitioning between the audio filesor audio streams in the audio output; and in response to receivingnegative feedback from the user about the transition between the audiofiles or audio streams, either: a) reverting back to an immediatelypreceding audio file or audio stream, or b) transitioning to anadditional new audio file or audio stream.